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Cross-Network Functions Via Linked Hardcopy And 
Electronic Documents 

The present invention relates to functionalities of computers and networks 
of computers, and more particularly to the provision of cross-network functions via 
linked hardcopy and (digital) electronic documents. 

It is well known to provide for the linking of electronic documents by 
setting up hyperlinks between documents in HTML format stored at, and transferable 
between, computers forming nodes of a conventional computer network, e.g. internet, 
intranet. See Berners-Lee T.J., Cailliau R., & Groff J.-F., The Worldwide Web, 
Computer Networks and ISDN Systems 25 (1992), pp. 454-459. North-Holland. 

There has also been made available technology (so-called SmartPaper, 
Xerox Corp.) whereby documents sent by fax can contain, in addition to the human- 
readable information (e.g. text), printed machine-readable encoded data. Once the fax 
document is received the encoded data is decoded at the receiving fax device, and an 
appropriate additional function is performed by the receiving device. Such functions 
may include the sending of a specified document to the sending fax device. 

However, the above techniques have considerable limitations in terms of 
accessing information (e.g. multimedia information) related to a hardcopy document 
in the user's possession at a given time. The first relies entirely on conventional UI 
techniques — keying in and/or point and click — with some personal networked 
computing device; and this requires the user to divert his attention away from the 
hardcopy document (e.g. a book) which he has been reading and it requires the user 
(a) to manually enter information (e.g. a WWW URL) needed to retrieve the related 
information and (b) if he doesn't know beforehand and has not been informed in the 
hardcopy document where to access the related information, to perform some search 
in order to find it 

The second technique provides only for a limited set of information of a 
limited type (printed documents) to be retrieved and viewed. Again, the user is 
required to stop reading the document, to mark it in appropriate way, and then to feed 
it manually into a fax machine or scanner coupled to a networked computing device, 
in order for the desired function to be performed. 
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There is a need for techniques which provide in a non-disruptive 
streamlined manner related information and related functions via a user's interactions 
with a hardcopy document. 

The present invention provides a method carried out in a printing system 
for producing a plurality of imageable substrates, comprising: (a) providing a plurality 
of sheet media; (b) for each sheet medium, printing coded machine-readable markings 
on the sheet medium to produce said substrate, said markings being printed so as to 
define a plurality of zones on the substrate, each zone containing machine readable 
first markings denning a page identification code, the page identification code being 
unique for each substrate. 

Preferably, each zone further contains machine readable second markings 
defining a page location code, the page location code being unique for each zone and 
defining the position of the zone on the surface of the substrate. 

The invention further provides an imageable substrate, comprising: a sheet 
medium; coded machine-readable markings formed on the sheet medium said 
markings being formed so as to define a plurality of zones on the substrate, each zone 
containing machine readable first markings defining a page identification code, the 
page identification code being unique to the substrate. 

The invention further provides a method of printing a document, each 
page of the document being printed on a respective substrate, each substrate being in 
accordance with any of claims 6 to 10 of the appended claims, comprising, for each 
page of the document: (e) deriving print data, said print data defining human-readable 
information to be printed on the page, said human readable information comprising at 
least one active element, (f) generating first association data, said first association 
data defining a mapping between the or each active element and the page location 
code within one or more corresponding zones of the substrate, said corresponding 
zones being overlapped by or adjacent a respective active element when the active 
element has been printed, (g) storing said first association data, and (h) using said 
print data, printing said human readable information on the substrate. 

The invention further provides a printed document, each page of the 
document being printed on a respective substrate, each substrate being in accordance 
with any of claims 6 to 10 of the appended claims, the document further including 
human readable information printed on the substrate. 
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Preferably, said human readable information comprises at least one active 
element, the or each active element and the page location codes within one or more 
corresponding zones of the substrate being associated, said corresponding zones being 
overlapped by or adjacent a respective active element in the printed document. 

The invention further provides a method carried out in an information 
processing system having a processing device coupled to a network and a handheld 
manually operable and positionable graphical data input device and an information 
output device coupled to the processing device, comprising: (k) in response to the 
user positioning the graphical data input device for image capture at a location in a 
printed document according to claim 16 or 17 and a first user input being entered by 
the user via the graphical data input device, deriving first image data defining a two- 
dimensional first image, (1) using the first image data, extracting the page 
identification code from one or more zones of the substrate present with said first 
image, (m) using the page identification code obtained in step (1), generating 
instruction data, said instruction data including said page identification code and a 
request for retrieval of a corresponding electronic document, (n) transmitting said 
instruction data over the network, (o) receiving over the network an electronic 
document corresponding to said page identification code, and (p) outputting said 
electronic document using said information output device. 

Embodiments of the invention will now be described, by way of example, 
with reference to the accompanying drawings, in which: 

Figure 1 illustrates schematically the scenario, provided by the invention, 
whereby a user may access electronic documents via interaction with hardcopy 
documents; 

Figure 2 shows schematically the routing of various data, including page 
identification codes and page location codes, over a network; 

Figure 3 illustrates a sample of zones, and the disposition of machine 
readable data, on an imageable substrate in accordance with one embodiment of the 
invention; 

Figure 4 is an illustration of an example of a printed document showing its 
components in accordance with the invention; 

Figure 5 shows how digital information is encoded in the machine 
readable data illustrated in Fig. 3 ; 
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Figure 6 shows the disposition of zones on a page, and an exemplary 
zone, in accordance with one embodiment of the invention; 

Figure 7 illustrates an exemplary zone in which machine readable data is 
encoded, in another embodiment of the invention; 

Figure 8 is a schematic illustration of the internal components of a 
graphical data input device or pointer in accordance with one embodiment of the 
invention; 

Figure 9 shows a configuration for passing page identification codes and 
page location codes to the network using a graphical data input device or pointer in 
accordance with another embodiment of the invention; 

Figure 10 illustrates the capture of an image containing coded zones using 
a graphical data input device or pointer in accordance with one embodiment of the 
invention; 

Figure 11 shows a variant of the graphical data input device or pointer in 
accordance with one embodiment of the invention, i.e. integral with a writing 
implement; 

Figure 12 is a flow chart of the processing steps performed by the local 
devices in implementing an embodiment of the invention; 

Figure 13 is a flow chart of the processing steps performed by the remote 
device (at which the digital electronic document is stored) in implementing an 
embodiment of the invention; and 

Figures 14 and 15 illustrate isomorphism between physical (hardcopy) 
and electronic documents. 

Reference is made herein to 'electronic document'. It will appreciated that 
this is intended to cover not only electronically stored files which can be presented to 
the user visibly (e.g. via printing or electronic display of a page or document), but 
also to audio and audio-visual documents and any other records of information that 
can be stored and retrieved electronically and presented by appropriate means to the 
user. 
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1. System overview 
1.1 Basic architecture 

Figure 1 illustrates schematically the scenario, provided by the invention, 
whereby a user may access electronic documents, and cause functions to be 
performed at a local or remote (networked) device, via interaction with hardcopy 
documents. 

The user may manipulate (e.g. position or move with a 'wiping' action) a 
special pointer 502 (described in detail hereinbelow) to capture local images of 
portions of a document 2 and feed data derived from the captured images to the 
network (e.g. via some local device (not shown) provided with network 
communications facilities): the elements of this data are discussed in further detail 
hereinbelow. The document 2 is, or is formed from, a special substrate (discussed in 
further detail below with reference to Figs 4 to 7), hereinafter referred to as "coded 
substrates", in which visible or invisible machine readable markings defining a page 
identification code and, optionally, a page location code are formed. 

Using this infrastructure, certain functions, e.g. the presentation to the user 
via a local device (peripheral) 4 of a digital page related to the document 2, or an 
operation upon information related to or present in the document 2, can be performed. 

Figure 2 shows schematically the routing of various data, including page 
identification codes and page location codes, over a network between the main 
architectural elements of the system. 

The user can click a button (not shown) of the pointer (similar to a mouse 
click) either at a certain location on the physical page 2, or on a small sticker 2' 
(similarly formed from a coded substrate and shown at the bottom left), on the 
peripheral 4. 

The pointer 502 preferably decodes two pieces of data — the page- 
identifier (pid) and the localisation on the page (loc). These decoding actions are 
shown as dotted lines. 

The page-identifier pid (or pid") is a number which needs to be resolved 
into a network address. This is done by sending it to a router 802, whose address is 
known a priori by the pointer 502. The router 802 returns two pieces of information: 
a network address for the page-identifier, and a type which can be either "peripheral- 
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»r "digital-page". The assoctanon of pid (or pitf) ^ its ^ ^ ^ „ 
the pouiter in a cache for possible future references. 

When the type of the page-identifier is "peripheral", the pointer 502 
asstgna the network address of th. periphend 4 to its peripheral-address store The 
location too. is not used. When it is "digM-page", i, (,) assigns the net™* 
address of the digital page « to its digital-page-addreas store, and (2), i, sends the page 
locahaanon ,00 and the peripheral address to the digital page address. This is 
dtscussed further below with reference to Figure ,2. The digits, page 
dtgrta, (dtaptayabWprintabfe) representation (e.g. Web page) of the printed (human- 
readabta) infonnadon in the document 2, or may he a digital representanon of some 
document or information related to, or derivable from, the printed document 2 

When fhe digita! page (located on a server (not shown), eg a 
convendonal Weh server on me nefwo*) receives data ftom me pointer 502, i, firs, 
decodes fhe locnUsadon loc into some action which typicaldy produces oufpu, and 
men sends fhis onfpn, to the periphera, address. The digital page fhen waita for new 
loc and periph^-address data. This is discussed further hefow with reference to 
Figure 13. 

1.2 Further network considerations 
1.2.1 page-id and network routing 

Addresses can he interprets globally (simitar to toteme, addresses) or Ve-empted" 
locally by home computer 

A variant is for an entity producing coded substrates ("coded substrate 
suppher") to associate a default Web page, at some centralised site, with any™--,* 
number it produce, The buyer of a coded substrate page ipso facto becomes the 
owner of this centralised Web page. He may have only limited rights as to using this 
page (for memory usage considerations), as for instance only the right to memorise in 
« the net address of bis own Web page, which contains the detailed data The 
centrahsed Web page provided by coded substrate supplier acts only as . W 
between" between the pag e-Ui and its associated digital page. The interest of this 
scheme is that mom anywhere in the world, the page-Ui can be routed to its 
corresponding centrahsed Web page through a generic routing scheme 
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Of course, optimisations of this scheme should be found, which do not require 
the owner of an address book living in Paris to route each page-id first through the 
headquarters of the coded substrate supplier in Palo Alto then back to his home 
computer in Paris. 

Another optional feature is caching network path associated to & page-id for 
access efficiency. 

12.2 Wireless communication, ubiquitous peripherals 

Functions that the communication infrastructure may provide include: 
allowing the pointer to send the page-id code to a predefined address router, 

which is able to determine the Internet address of the digital page corresponding to 

this page-id , 

getting back (and possibly caching for future use) this page Internet address, 
(possibly) sending a page-id-code to the page Internet address, 
sending pointer-loc codes to the page Internet address, 

sending to the page Internet address the Internet address (or phone number, 
etc.) of the peripheral to which output should be sent, 

sending back to this peripheral the output produced by the pointer-loc 
interpretation process associated to the digital page. 

2. Encoding data on substrates 

Current or shortly available printing technology, conservatively estimated, 
permits to print on the order of 1 000 bits of reliable information per square centimetre 
(according to recent public releases, the Glyph™ technology developed by Xerox 
currently permits to encode on the order of 500 bits/cm2, including a robust error 
correction scheme, and assuming printing and scanning at 400 dpi. Considering the 
special constraints placed on Glyph™ (unobstructiveness to human visual 
perception), which are not shared by our invisible marks, and also keeping in mind 
that the production of coded substrate will be a process carried out by specialised 
machines, our assumption of 1000 bits/cm2 seems reasonable). 

A recent CACM article about address protocols currently being devised 
for the Internet notes that 128 bits theoretically permits to address 3.4 x 1038 items, 
or 6.7 x 1023 items per square meter of the surface of the Earth. Because in reality 
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"the assignment and routing of addresses requires the creation of hierarchies that 
reduces the efficiency of the usage of the address space", these figures should be 
lowered somewhat in practice, a pessimistic estimate being 8 x 1017 nodes (1564 
addresses per square meter of the Earth), an optimistic one 2 x 1033 nodes (4 x 1018 
addresses per square meter of the Earth). 

It can be safely assumed that the data zone in each cell holds 256 bits of 
reliable information. Of these, 128 bits hold the page-id (which is thus redundantly 
repeated on each cell of the page), 32 bits hold the cell localisation (or cell address) 
on the page (this corresponds to a maximum number of 65536 cells on a page, this 
corresponding to a 123cm x 123cm maximum page). 224 bits are left for the page-id- 
code (and possibly other information, such as publisher's private data. 

We will now assume that the pointer device has the shape of a pen, and 
that its tip, when positioned anywhere in a given cell is able to "see" the whole cell. 
By decoding the cell address, the pointer knows the localisation of the cell on the 
page. By seeing the cell border, the pointer is also able to determine its precise 
position inside the cell. Thus the pointer is able to localise itself precisely relative to 
the whole page. The pointer is also able to read the page-id and the page-id-code and 
therefore to send these two, along with its page localisation, for interpretation via the 
external infrastructure. 

The code scheme we have just described is but one of several possibilities. 
2.1 Invisible code marking 

Let us now turn to another point: how does the coded substrate supplier 
produce the just described codes as invisible marks on paper, and how these marks 
can then be recovered by the pointer device. 

One option is the use of ultra-violet ink to print the invisible codes on 
white paper. Such an ink looks transparent to the human eye, yet an area of paper 
covered by it reflect UV light differently from an uncovered area. 

In this case, the coded substrate supplier produces sheet of paper in 
different formats for uses by the publishing industry. Each sheet is processed through 
a specialised printing procedure which (1) assigns a fresh page-id (and possibly page- 
id-code) to this sheet, and (2) prints in UV ink the corresponding data cells on the 
surface of the sheet. 
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The publisher buys these apparently uniformly white sheet and now 
prints, in standard ink, visible marks on them. 

The end user positions the pointer device on the page. The pointer emits 
UV light and reads back the UV light reflected by the UV-ink layer (rather than 
emitting UV light, the pointer might also exploit the UV component of natural 
"white" light). This light, apart from being invisible to the human eye, is similar to 
conventional light and "standard" techniques for optical scanning can be applied to 
recover the UV-ink data. 

In certain cases it may be necessary to use different inks for code marking 
to avoid problems of the conventional ink layers blocking the transmission of the UV 
light. However, in standard trichromatic printing, three layers of coloured inks 
(yellow, magenta, and cyan) are superposed to produce all visible colours (in fact, for 
practical reasons, quadrichromatic printing, where black is the fourth "colour", is 
more often used than trichromatic printing, but let's ignore this here). Each layer 
filters one part of the light spectrum independently of the others. That is, the magenta 
layer does not block light from the cyan and yellow layers, etc. 

Other to UV ink are infrared ink, phosphorescent ink (phosphorescence is 
used in TV screens) or some type of magnetic ink (which has been proposed in the 
past for bar-code type scanning). It should also be remembered that it is not essential 
that the codes be encoded through optical means. The only crucial properties are that 
they be durable, do not interfere with the visible printed marks, and permit to recover 
page-id, page-id-code and pointer localisation. 

Figure 3 illustrates a sample of zones, and the disposition of machine 
readable data, on an imageable substrate in accordance with one embodiment of the 
invention: this shows substrate markings consisting of four elements — 

cell borders, which must be distinguishable from cell contents 
some indication of the orientation of the page 
page-id, represented within the cell border 
localisation information, represented within the cell border. 
Each zone or cell 202 includes a border 204 and an orientation marker 
206. A first set of markings 208 over part of the interior of the cell 202 are encoded 
representations of the page-id, while a second set of markings 210 over a (smaller) 
part of the interior of the cell 202 are encoded representations of the localisation 
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(page-Ioc) - BApriy ^ position of ^ ^ ^ ^ ^ ^ 

onentetton of the page _ ^ by . ^ fa one ^ rf ^ ^ ^ 
asymmetric in both dimensions. The eells are preferably „,ed to cover the enure page 
In F.g. 3, the page-id and localisation are shown sehematioally; detail is 
given in Fig. 5, discussed further hereinafter. 

Figure 4 illustrates the components of a printed document as printed on a 
coded substrate. The printed document 102 comprises a layer .04 of printed visible 
ftuman-readable) inmrmarion (document content) printed on a coded substrate 106 
The coded substrate ,06 in mm comprises a Uyer .08 o, visiMe or invisibte machine 
readabm markings (encoding the page-id and .ocahsation,. printed on a she* medium 
110 (e.g. paper). 

FigUKS 5A ^ *gi« information is encoded in me machine 

readable dam Otoarated in Fig. 3, i.e. the ceu or zone contents. Fig. 3 shows the 
bmary dam, i.e. 47 bite of page-id in me upper section 302 (the bit stieam wraps a. the 
cel. border 204), ami ,6 its of page .ocansation darn (loc) in m. lower section 304) 
The paged cod. denotes 1 08429 159095492 = 629DA182DCC4 (hexadecimal) = 
1 lOOOlOlOOmOllOlOOOOI lOOOOOlOl lOUlOOl 1000100000.000000010.0. 
(bunny; to make the wrapping expUcit). Wittun me border of a cell, me page-id is 
ranted as an ordinal number, in binary (Fig. 5A). The ceU is 8 by 8 symbols (or 
* »s) wtde, W m one aymbora worth of space teken up by me orientation mark, to the 
16 b« loc code in section 304, mere are 8 bite for the X co-ordinate and 8 bite for the 
Y co-ordmate. Thus, for the ceU (zone) shown, its position is 16^> on the substrate 
Locauaation identiSea a cell (zone) on the page, no. a point on the page. 

With mis encoding scheme, a complete ceU mus, be within the region of 
*e page from which page-id and .oca.isa.ion are to be removed. Less than a 
complete cell does not contain sufficient information. 

The addition of error correction information is desirable. This can be 
done with standard technics and is no, shown; me acmal encoding after the addition 
of etror correction information wiU be different ftom what is shown, and will «,uire 
more bits (and thus more space). 

The number of possible pages and the number of possible cells on a page 
are a mnction of the density of the encoding. Using the encoding shown, 2 to the 
47th pages can be identified, with up to 64 by 64 cells on each. 
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Fig. 5B shows the same data as in Fig. 5 A, but represented by Data Glyph 
markings. Encoding using data glyphs and the retrieval of data therefrom is discussed 
further in US-A-5,486,686, EP-A-469864, and the abovementioned GB application 
(ref.R/98003/JDR). Here, there is a first set of glyphs (markings) in upper section 402 
and a second set in lower section 404, the two sets of glyphs being encoded 
representations of page-id and loc codes. DataGlyphs offer advantages in robust 
decoding. 

In accordance with the invention, the page-id, page-loc and any other data 
can be encoded on the coded substrate in any other suitable manner. For example, the 
encoding may be as described in any of: U.S patents 4,786,940; 5,245,165; 5,278,400; 
5,315,098; 5,449,895; 5,521,372; 5,537,223 ; 5,572,010 and 5,576,532. 

The inks used to encode the abovementioned data may be visible or 
invisible, and may be detected by visible optical, IR, UV, magnetic or any other 
suitable means. For example, suitable inks for the markings encoding the data may be 
as described in: U.S patents: 5,075,186; 5,225,900; 5,301,044; 5,145,518; 5,046,087; 
5,275,647; 5,208,630; 5,202,265; 5,271,764; 5,256,193; 5,286,286; 5,331,140; or 
5,291,243. 

Figure 6 shows the disposition of zones on a page, and an exemplary 
zone, in accordance with one embodiment of the invention. 

Figure 7 illustrates an exemplary zone in which machine readable data is 
encoded, in another embodiment of the invention. Here, a paper page is divided into 
0.5 cm x 0.5 cm square cells. Each such cell can contain on the order of 256 bits of 
information. The cell is identifiable by its distinctive border, consisting of 
uninterrupted horizontal and vertical lines of black pixels. A square of pixels, which 
is separated from the black border by a white area one pixel wide, holds the data (in 
the illustration this square is shown 10 pixels x 10 pixels wide, but in reality it would 
have a higher resolution). 

3. User-positionable pointer/scanner 

This device may take the form of a pen-shaped pointer, provided with 
clicker. Additionally/ alternatively the pointer may include a small transparent screen, 
rimmed with a code-reading device, with a cross hair in the middle: this may be a less 
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costly to ta ^ nearer . temi ^ a pen _ shaped pointerj ^ ^ ^ 

of the surface of the page at one time. 

Figure 8 is a schematic illustration of the internal components of a 
data input device or pointer in accordance with one embodLnt of L 
— pointer is a complex object that ma y he a single physical device o 
may be several communicating devices. It contains 

1. A hand-held image-reading device, containing a button. 

2. An image-decoding device. 

3. A network communication device. 

The image-reading device ^ md 

the network communication device as described m Fig . , 2 . * 
elemem* „. FiSUre 8 *■** * *" M PU ^ M **« *— * and the 

^coding software, nerivoric comnumicauon software, and con J 
oftwa^d network common hatdwam 8I 0. (The ftmcdonamies of softwam can 
b« reproduce « na J dwa re if** „ ^ fc 

*r . digital-pag. netwolk ^ md m putpu , periptea , ^ 
- . cache comprising a se, of «p te < pid , ^ ' 
dtsbngutsbes me pidofa digital page from «ha, of a peripheral. 

The image-reading device can be realised as a pen-camera or hand-held 
-ner („ CCD Contents with ^ „ ^ ^ ^ ^ J" 
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The function of a frame grabber is to capture a snapshot in memory from 
the image data that the camera continuously produces. If the camera is not integrated 
with or tethered to the image decoding device, both devices must have wireless 
communication facilities to allow the camera to transit the image data to the frame 
grabber. (There are performance considerations, because image data is large.) The 
button click also must be communicated, as it resides on the camera for ergonomic 
reasons, but activates the frame grabber. 

The snapshot is input to the image decoding device, whose output is the 
<pid, loo pair. If the image-decoding device is not integrated with the network 
communication device, the <pid, loo pair must be transmitted to the network 
communication device. Again, this requires each device to have communication 
facilities (a cord, or some form of wireless communication such as infra-red, or 
mobile telephony). 

An alternate description of the pointer is shown in Fig. 9: this shows a 
configuration for passing page identification codes and/or page location codes from 
the pointer of 11 to a network computer, in accordance with an embodiment of the 
invention. The image capture device (e.g. CCD camera) 506 is coupled by wired or 
wireless (e.g. IR or RF) link to processing device 602 and in use provides image data 
defining capture images to the processing device 602. The operative elements of the 
processing device 602 are a frame grabber circuit 604, image decoding software 606, 
and a CPU 608, which are known in the art. (In certain embodiments, the camera 506 
and processing device 602 may be* combined into an integral handheld unit; Fig. 8). 
In use, the processing device 602 extracts from the image data the corresponding 
page-id and page-location data (<pid, loo) and communicates them in a wired or 
wireless fashion to a local device (here, a network computer 610, which is linked to 
the network (intranet, internet) in a known manner). The computer 610 has its own 
unique network address, but need not have any information output device (e.g. display 
screen, printer). 

Here it is the network computer that has the cache and the digital-page 
network address and output-peripheral network address storage. As the network 
computer need not have a screen, it does not have to be a large device. 

Figure 10 illustrates the capture of an image containing coded zones 202 
using a graphical data input device or pointer in accordance with one embodiment of 
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embodiments a communication protocol and infrastructure, including the need for 
widely available wireless communication between pointers, peripherals, and 
computers. 

These requirements may be relaxed at the cost of a tighter coupling of 
coded substrates to peripherals. For example: 

• Shakespeare's play Othello could be published, with similar 
capabilities, as a book/CD/pointer/software package for use with a computer 
containing a CD player. If it is assumed that a CD player that can interact with a 
monitor, it could be published as a book/CD/pointer package, where the software is 
built into the pointer (which controls the CD player). 

• If it is assumed that wireless telephony is available and the pointer can 
contain a telephone, an coded substrate sales catalogue could allow a user (called, say, 
Ariane) to place an order by clicking on an item. The catalogue provider's phone 
number could be encoded in the substrate along with the page-id and pointer location 
information. Clicking on an item then initiates a call to the encoded number. The 
page-id and pointer location are thereby transmitted to the provider. These identify 
both the item and the customer, allowing order to be placed. 



5.2 Extensions 

5.2.1 Extensions of the pointer device: 

• Sensitive transparent screen overlaid on a coded substrate. Here, a 
flexible, transparent screen made of plastic (or rubber, etc.) is provided, in one (or 
several) of its comers, with a flat coded substrate code reading device, and which 
when touched with a finger, is able to determine the location of the finger relative to 
itself. Such a screen can for instance serve as a universal keyboard: overlay it over a 
coded substrate on which a keyboard has been drawn. The user "types" at a certain 
location, and the screen "knows" what this location is, not only relative to its own co- 
ordinates, but also (because of the information read at the corners) relative to the 
intrinsic paper co-ordinates. The infrastructure can then determine that this is the 
location of letter A. 

Such transparent screens could be more practical than a pointer when 
several fingers act in parallel, like when typing on a keyboard. This also makes 
possible implementing some "paper-based" musical instruments on this principle. 
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if the device has a small "footstep" (active scanning surface), because the images 
scanned in over several moves will be easy to connect into a global picture. 

5.2.2 Extensions of the paper substrate: 

• Coded Confetti. These are small round stickers made from a coded 
substrate. In particular, each sticker has a unique page-id. They can be used to add 
multimedia links to a piece of conventional paper. The user buys a few of these coded 
confetti. He is perusing a conventional document, printed on standard paper, and 
wants to associate a certain action to a certain location on a given page of this 
document (this action could be to go to a certain Web site, to ring Mary's phone 
number, to print an address on an envelope). He sticks the confetti (which could be 
coloured or transparent) at the location of interest and associates (through his personal 
computer, or through a service provided by the confetti provider) this action with the 
confetti's page-id. 

Coded Confetti can be seen as a transition technology that permits to 
confer to conventional documents some of the advantages of CODED SUBSTRATE 
documents. Obviously these confetti could be stuck on any support, not only printed 
material. A confetti stuck to a machine part could have the effect of showing a 
description of this piece on a screen, when pointed to. 

5.2.3 Extensions of the coded substrate document architecture: 
• Interdocument communication. In the embodiments previously 
discussed, it has generally been assumed that every desirable capability of a coded 
substrate document is provided by its publisher independently of the existence of 
other coded substrate documents. Here, it is briefly described how communication 
can be established among coded substrate documents produced by different 
publishers, increasing overall document functionality, on the basis of two examples. 

• Computerless email: Ariane wants to send a message to her friend 
Irene. She writes the message, using a writer-pointer (Fig. 1 1), on NotePaper, a pad of 
coded substrates designed for sending notes. She then looks up Irene in her address 
book, itself an coded substrate document. She clicks the pointer on a Print icon in 
Irene's listing. She then clicks on a Send icon on the sheet of NotePaper containing 
the message. The NotePaper application sends the image of that sheet of NotePaper to 
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a wide range of coded substrate documents, and for which support can be provided in 
a generic manner; several of these are described here. 

To reiterate, a digital page receives positions (x, y co-ordinates) and produces 
output (which may take the form of instructions to run a program on another machine, 
and input data for that program). The manner in which the co-ordinates are delivered 
to the digital page is not of consequence here, nor is the manner that the output is 
delivered to the machine on which it is to be displayed or executed; those elements 
are described elsewhere. 

A basic coded substrate capability is to initiate an action by clicking in an 
active region on the page (which is typically graphically marked, as with icons, 
labelled boxes, or highlighted words). Active regions have two elements: regions and 
actions. 

6.1.2 Regions 

A digital page receives the position of the pointer, and must determine if that 
position is within an active region. To this end, it must have a description of the 
active regions on the page. 

Rectangular regions have a particularly simple description. Given two 
diagonally opposing corners of a rectangle, you can tell whether a given point lies 
within it or not, and the function used to tell is also particularly simple. 

Arbitrarily complex regions can be described in such a manner that you can 
tell whether a given point lies within it. The function needed to decide may be 
complex, and different types of descriptions require different functions. We will call 
a pair consisting of a description of a region and a function to decide whether a point 
lies in it a "region". 

6.1.3 Actions 

Once the digital page has determined that a position is within an active region, 
it must cause an action (or a number of actions) to be executed. It therefore must 
have a description of the action(s) for that region. Possible ways to describe actions 
include (but aren't limited to) programs (or functions), with or without input, pointers 
to programs (internal or external to the digital page), with or without input, and type, 
data pairs, where the type element identifies a conventional action to take, and the 
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execute a program on another machine. In the case of coded substrates, the peripheral 
near the user is registered with the communication infrastructure. Either as part of the 
registration process or explicitly by the user at the point of interacting with a coded 
substrate document, an output player is launched on the peripheral. The digital page 
can send data to the output player, whose address is known to it by virtue of the 
registration; it need not launch a program on the peripheral. The output player, in 
contrast, can launch programs on the peripheral (subject to local permissions). 

6.1.6 Communication with Applications 

In the course of executing actions, a digital page may launch other 
applications. When a digital page launches an application with which it may have 
continuing interaction, it maintains a connection with the application (such as a 
socket). The digital page may launch an application (for example, a browser) on the 
user's peripheral via the output player. The output player returns to the digital page 
the information it needs (such as a socket name) to establish a direct connection with 
the application. 

A digital page may also provide a mechanism to allow the user to establish a 
connection between itself and an already-running application. 

6.2 Output Actions 

Here we describe one way to implement the output actions attributed to user 
(hereafter called Ariane) herein. A variety of other possibilities exist. 

6.2.1 Action 1: Text Output 

Ariane is in doubt about the meaning of a word. She clicks on it; an 
explanatory note for the word is displayed on her TV screen. 

The digital page is responsible for producing the explanatory note; how it gets 
to Ariane's TV screen is the responsibility of the communication infrastructure. 

The active action table contains a pair W, E, where W is the region containing 
the word, and E is, for example Output, Exp, where Output is a type meaning 
"display the data element of the action pair to the user" and Exp is the text of the 
explanation. 
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trigger location of the point in the reading, the audio data from there to the end of the 
segment could be output. 

6.2.3 Action 3: Video Output 

Ariane clicks on the PLAY SCENE box, then on the beginning of a scene, and 
a video of Orson Welles* interpretation of the scene is played on her TV. 

This is just like the previous case, but the action table for the PLAY SCENE 
box has entries appropriate to playing videos of scenes. 

6.3 Annotating; Adding Actions 

The example is the following: Ariane has previously connected to the Web 
pages provided with the coded substrate document that she's reading (Othello), by 
clicking on the MAIN WEB PAGE box. As a result, she has a browser on her screen, 
open to one of these Web pages. She uses a normal pencil to make a note to herself 
on the page. She clicks on the LINK TO WEB PAGE box. She clicks on the note 
she just wrote. Henceforth, clicking on her note causes the Web page to pop up on 
the screen. 

A similar but slightly more general example is easier to illustrate, after which 
we'll return to the above. Consider: Ariane has a browser on her screen, open to an 
arbitrary Web page. She uses a normal pencil to make a note to herself on the page. 
She clicks on the LINK TO WEB PAGE box. She clicks on the note she just wrote. 
A dialog box pops up on the screen. She copies the URL of the Web page from the 
browser to the dialog box and clicks "Apply" in the dialog box. Henceforth, clicking 
on her note causes the Web page to pop up on the screen. 

The active action table contains an entry for the LINK TO WEB PAGE box. 
Its first action changes the mode of the digital page. Rather than continuing in its 
normal event-driven mode, the digital page waits for the next click from the user. 
When it's received, the position P of the pointer is recorded, rather than being 
interpreted as a standard interaction (it is not looked up in the active action table, as 
would normally be the case). A region R of some conventional size with P at its 
centre is created. The above-described dialog box interaction is initiated, and the 
URL U given by the user is used as the data element of an action URL, U. The entry 
R, URL, U is added to the active action table. When the user subsequently clicks in 
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How It's Done. The active action table contains an entry for the MOUSE 
MODE box. Its actions follow: A command is sent to the pointer, to change its mode 
of operation. In the new mode, the pointer initiates communication every n timer 
ticks, sending the digital page its position each time, in addition to sending its 
position with a click event when the user clicks. [This change of mode, and thus the 
need for the pointer to receive control information fiom the digital page, can be 
avoided at the cost of more communication traffic, by having the pointer always send 
its position every n ticks. In non-MOUSE-MODE, the digital page would ignore 
positional information that is not accompanied by a click event. Alternatively, the 
pointer could distinguish itself whether it's in mouse-mode or non-mouse mode (e.g., 
the button is held down during mouse mode, or alternatively, positions are sent everji 
n timer ticks when the pointer is moved in contact with the page).] 

The mode of the digital page process is changed, too. The first position RP 
that the digital page receives with no accompanying mouse click is recorded. A cursor 
is displayed on the output screen, [cursor display is one of the capabilities of the 
output player] and its position CP on the screen is recorded. Each time a position P is 
received, the difference between P and RP is computed. That difference is added to 
CP and the cursor is displayed at the new value of CP. RP is replaced by P, and the 
process iterates. 

When the digital page receives a position P with a mouse click event, it 
updates CP and RP as usual, and in addition it forwards the mouse click event, with 
CP substituted for P, to the application. By "forwards the mouse click event" we 
mean that the digital page puts the same type of mouse click event (changing the 
position) in the application's event stream. 

Of course, if the application is the digital page itself, these issues are absent. 

Absolute Positioning. What has just been described is how to use positional 
information relative to the printed page frame to simulate a standard mouse. As a 
positioning interface, this has the advantage of being familiar to mouse users. 

This is relative positioning: the user moves the pointer relative to the current 
position of the cursor, and the absolute position of the pointer on the page is 
irrelevant. 
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The digital page uses this action table whenever it receives double clicks. [It is 
assumed that the digital page can distinguish double clicks as such. This may require 
sending a time-stamp with the click event, as transmission times from the pointer to 
the digital page may not be uniform. Or the pointer could recognise the click type.] 

Suppose now we add another selection method, triple clicking to select a line. 
Clearly we can add an action table with an entry for each line. 

Finally, let's consider another common text selection method: pushing the 
button down, dragging the pointer, and releasing the button elsewhere. The digital 
page receives the position of the pointer at the two locations. 

This selection method is not efficiently amenable to an action-table 
implementation (without a separate representation of content and a mapping from 
positions into that representation). An implementation analogous to the ones just 
described would require entries for all pairs of characters. 

We turn to the example to illustrate the concept of selecting content. The 
example, in which Ariane wants to have the word "exsufflicate" translated into 
French, is as follows: 

She clicks on the box TRANSLATIONS on the top of the physical page. A 
special menu appears on the screen, giving a choice of several languages. She selects 
French. She now has a choice between either a word-in-context automatic translation 
or a number of standard literary translations. She selects Gide's translation, then 
directs the pointer to the difficult word. Gide's translation of the line appears on the 
screen, with the word(s) corresponding to "exsufflicate" singled out in boldface. 

Note that the entire line is translated, though the user pointed to a single word. 
This can be done with action tables as follows: The TRANSLATIONS box 
has an entry in the active action table, or its region is type Action; either way the 
action puts up the menus and collects the user's decisions. The digital page has access 
to Gide's translation, which is aligned sentence-for-sentence, and within each 
sentence, word for word, with Ariane's copy of Othello. Given a sentence number 
and the number of a word within the sentence, the translation function finds the 
corresponding sentence and word (or words) in the translated text. It boldfaces the 
word and returns the translation. 
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delays) and in parallel. Button clicks on the pointer are treated as clicks on the 
mouse. The digital page process acts just as it would if the user were moving the 
cursor, and clicking, with the mouse. 

Given this kind of parallelism, standard computer application techniques (such as 
those used in editors and browsers) may be used to implement Ariane's actions. 
Provision of support for those actions can be made using techniques known in the art. 

This parallelism can be achieved by associating the digital page process with a 
special purpose window manager. This window manager manages the window 
displaying the page. The window size is identical to the page size. The window 
manager takes positional information from the pointer rather than the mouse-but this 
is transparent to the digital page process. 

There is no need for this window to be rendered to a screen—it can be entirely 

virtual. 

The isomorphism is between the image on the physical page and one aspect of 
the digital page: a digital representation of that image (or information from which 
such a representation can be constructed). The digital page contains more 
information, including the mapping from the image to the content and the actions to 
be performed. 

Constructing a digital representation that looks like what you get when you 
print to paper was a difficult problem-most text editors and browsers display a 
document that differs on-screen to one degree or another from the printed version. 
Adobe PDF format provides page-description functionalities that are strong enough 
that the printed and digital representations truly look alike. Other page description 
fonnats may provide similarly. 

A special-purpose window manager is just one way among many to take 
advantage of such an isomorphism. [Another way would be to use a standard window 
manager, but to modify the positional and event information as it comes from the 
pointer so that it appears to the window manager to be a mouse event. This moves 
some of the special-purpose processing, and some elements of window-management, 
into the communication infrastructure, which would communicate with the window 
manager. Alternatively, positions can be interpreted directly by the digital page 
process. Window management per se (which window is on top, which application 
handles an event) isn't relevant. 
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Consider the representative case of a classical recording publishing 
company, Digital Grammophon, or DG; this company distributes musical recordings 
over the net, and it must find ways to have listeners pay for them, coded substrate can 
help to solve this problem. 

Consider the following scenario. DG sells, through record or book shops, 
"listening cards" printed on coded substrate. One such card, for instance, lists the last 
six Mozart symphonies. Each movement is listed as a subheading under the 
corresponding symphony heading. The customer buys the card, goes back home, 
clicks on the heading for the 40th, and the music is played for him on his hi-fi 
peripheral device. 

Is this scheme commercially secure for the DG company with respect to 
an unscrupulous user intent on defeating copyrights? 

• Such a user might legally buy a listening card for the symphonies and 
reproduce, through standard photocopying, its printed content on thousands of cards, 
and distribute these through parallel channels. Of course, this scheme will not work. 
A buyer of such a card will not be able to use it in any way. It is instructive to 
compare this situation with the case where the cards produced by DG would be 
printed on conventional paper, and where marks printed on these cards, such as 
barcodes, would encode the addresses of the music tracks. In this case, a standard 
photocopy of the original card would be functionally indistinguishable from this 
original card. 

• Such a user might become more sophisticated and produce perfect 
copies of the original card, including the coded substrate (an operation, which, by the 
way, may be illegal if coded substrates were a monopoly of a coded substrate 
supplier). 

Although such an operation would be technically difficult and costly, it 
might not be impossible. However, DG can easily prevent such a problem. Each piece 
of coded substrate has a unique identification number page-id, which is associated at 
the DG site with the digital representation of the page (this is of course a many-to-one 
mapping). Because of this unique identification for each physical card, DG can detect 
two simultaneous access attempts through the same page-id, and knows that this event 
corresponds to an illegal situation. It can then take measures of different degrees of 
severity. A drastic measure would be to void the guilty page-id of any future access 
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code numbers associated with these pages. The only way she could know these 
numbers would be by buying all the DG cards corresponding to the symphonies, a 
somewhat uncongenial prospect. 

We thus see that, while coded substrates may still be copiable, given the 
aforementioned additional technology, with a page-identifier coding scheme, it 
becomes unforgeable: in order to produce a usable card, the pirate needs to own' an 
original for that card. 

It should be noted here that the idea of privately encoding the page 
identifiers does not strictly require the coded substrate medium. It could also be 
implemented via conventional printed marks on the card. Although this might already 
be an interesting possibility in itself, the coded substrate medium fits the bill very 
well here, because one generic architecture (paper, pointer, net infrastructure), already 
employed in several other functions, can be readily adapted to the case at hand The 
only modification of the original scheme consists in adding a page-id-code to the 
page-id during the production phase at the coded substrate supplier, an operation with 
minimal supplementary cost 

Another access rights example is as follows. The user buys stamp at local 
news-stand, transfers its code to mail-order house, mail-order house transfers code to 
bank 

Suppose the coded substrate supplier, under the supervision of a 
commercial bank, produces small pieces of adhesive coded substrates, shaped like 
post stamps, and printed in various denominations (10cts,l$,10$, etc.). These stamps 
are sold, under a sealed wrapper, at the bank offices, through local news-stands etc 
Using coded substrate technology, the stamps are provided with a unique page-id and 
with a corresponding page-id-code (see previous section). The pair <page-id .page- 
id-code > is known only to the bank. 

A customer wishes to purchase some catalogue item through a mail-order 
company. She buys enough stamps to cover her order, sticks them on an empty space 
provided next to the item in the catalogue, and then, using the pointer, clicks first on 
the item, then on each of the stamps. The action of clicking on a stamp has the effect 
of transferring the <page-id , P age-id-code > pair to the mail-order company. The 
mail-order company can then transfer these codes to the emitting bank, which credits 
the mail-order company's account for an amount equal to the stamp denomination. 

Substitute Sheet (Rule 26) 



WO 99/50787 

34 PCT/US58/2C59- 
The b** wiu ^ „ operatiott ^ a ^ ^ ^ 

once: any attempt to ^ ^ pajr &r , late ^ > 2 

-P «- be used for payment on!y once ta its ^ ^ ~ 

Tie procedure jus, sketched oeeds to be compared with other forms of 

sm ~ y is 1,16 **• """» ^ — ~ 

buyer of stamps - can be provided relatively easily. 



Substitute Sheet (Rule 26) 



WO 99/50787 
CLAIMS 



35 



PCT/US98/20597 



1 • A method for producing identifiable sheets comprising: 

(A) obtaining sheet medium; and 

(B) producing one or more identifiable sheets from the sheet medium; 
in which (B) comprises: 

(Bl) printing machine-readable markmgs within one or more sheet areas of the 
sheet medium, each sheet area being an area of the sheet medium from which one of 
the identifiable sheets is produced; the markings within each sheet area defining two 
or more zones of the sheet area; the markings within the zones of each sheet area 
indicating a sheet identifier; the sheet identifier identifying the identifiable sheet 
produced from the sheet area. 

2. The method of claim 1 in which the markings within each zone of each sheet 
area further indicate a position of the zone within the sheet area. 



3. The method of claim 2 in which (Bl) comprises printing, within each zone, 
sheet identifier markings indicating the sheet identifier and position markings 
indicating the position. 



4. The method of claim 1 in which (Bl) comprises printing the machine-readable 
markings with an invisible ink. 

5. An identifiable sheet comprising: 
an area of sheet medium; and 

machine-readable markings within the area of sheet medium; the markings within the 
area of sheet medium defining two or more zones of the area of sheet medium; the 
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the sheet identifier identifying the identifiable sheet. 

«. Tie identifiable sheet of data 5 in which the markings within eaei zone of 
the sheet area further indicate a position of the zone within the sheet area. 

7. ^i'fc^lesheetofclaimoinwmchthemarldngawi.hm^zone 
aclndeaheetidaotifier markings indicating the sheet identifier and position nttrkings 
indicating the position. 

«. T^^^ofclamtStawm^^ 

invisible. 6 
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